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Abstract —Endowed with context-awareness and proactive ca¬ 
pabilities, caching users’ content locally at the edge of the 
network is able to cope with increasing data traffic demand 
in 5G wireless networks. In this work, we focus on the energy 
consumption aspects of cache-enabled wireless cellular networks, 
specifically in terms of area power consumption (APC) and 
energy efficiency (EE). We assume that both base stations (BSs) 
and mobile users are distributed according to homogeneous 
Poisson point processes (PPPs) and we introduce a detailed power 
model that takes into account caching. We study the conditions 
under which the area power consumption is minimized with 
respect to BS transmit power, while ensuring a certain quality 
of service (QoS) in terms of coverage probability. Eurthermore, 
we provide the optimal BS transmit power that maximizes the 
area spectral efficiency per unit total power spent. The main 
takeaway of this paper is that caching seems to be an energy 
efficient solution. 

Index Terms —5G, mobile wireless networks, caching, energy 
efficiency, area power consumption, stochastic geometry 

I. Introduction 

Fueled by the ubiquity of new wireless devices and smart 
phones, as well as the proliferation of bandwidth-intensive ap¬ 
plications, user demand for wireless data traffic has been grow¬ 
ing tremendously, increasing the corresponding network load 
in an exponential manner. This is further exacerbated by rich 
media applications associated with video streaming and social 
networking. In parallel, both academia and industry are now 
in the urge of evolving traditional cellular networks towards 
the next-generation broadband mobile networks, coined as 5G 
networks, targeting to satisfy the mobile data tsunami while 
minimizing expenditures and energy consumption. Among 
these intensive efforts, caching users’ content locally at the 
edge of the network is considered as one of the most disruptive 
paradigms in 5G networks Q. 

Interestingly yet not surprising, recent results have shown 
that distributed content caching can significantly offload dif¬ 
ferent parts of the network, such as in radio access networks 
(RANs) and core network (CN), by smartly prefetching and 
storing content closer to the end-users 0. Indeed, traditional 
cellular networks, which have been designed for mobile de¬ 
vices with limited processing and storage capabilities, have 
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started incorporating context-aware and proactive capabilities, 
fueled by recent advancements in processing power and stor¬ 
age. As a result, caching has recently taken the 5G literature 
by storm. 

The idea of caching at the edge of the network, namely at 
the level of base stations (BSs) and user terminals (UTs), have 
been highlighted in various works, including edge caching 
1^, FemtoCaching Q, and proactive caching Q. Spatially 
random distributed cache-enabled BSs are modeled in 0, 
and expressions for the outage probability and average de¬ 
livery rate are derived therein. Another stochastic framework 
but in the context of cache-enabled device-to-device (D2D) 
communications is presented in studying performance 
metrics that quantify the local and global fraction of served 
content requests. Although most prior work (see 0 for a 
recent survey) deals with different aspects of caching, i.e. 
performance characterization and approximate algorithms, the 
energy consumption behavior of cache-enabled BSs in densely 
deployed scenarios has not been investigated. This is precisely 
the focus of this paper. 

Energy consumption aspects of cellular networks are usually 
investigated by placing BSs on a regular hexagonal or grid 
topology and conducting intensive system-level simulations 
10. Therein, the aim is to hnd the optimal values of pa¬ 
rameters, such as cell range and BS transmit power, while 
minimizing the total power consumed under certain quality- 
of-service (QoS) constraints. Despite its attractiveness, it is 
rather cumbersome or even impossible to analytically evaluate 
key performance metrics in large-scale networks. A common 
simplihcation in modeling cellular networks is to place the 
BSs on a two dimensional plane according to a homogeneous 
spatial Poisson point process (PPP), enabling to handle the 
problem analytically | |T0| . Several works studied the validity 
of PPP modeling of BSs compared to regular cellular models 
(e.g. d)) and additional insights can be obtained by analyt¬ 
ically characterizing performance metrics, such as the spatial 
distribution of signal-to-interference-plus-noise ratio (SINR), 
coverage probability, and average rate o. 

In this work, we analyze energy consumption aspects of 
cache-enabled wireless network deployments using a spatial 
model based on stochastic geometry, which to the best of our 
knowledge has not been addressed in prior works. We consider 


the problem of area power consumption (APC) and energy 
efficiency (EE) of cache-enabled BSs in a scenario where 
BSs and mobile users are distributed according to independent 
homogeneous PPPs. We provide conditions under which the 
area power consumption is minimized subject to a certain 
quality of service (QoS) in terms of coverage probability. 
Eurthermore, we provide the optimal BS transmit power that 
maximizes the energy efficiency, defined as the ratio of area 
spectral efficiency over the total power consumed. The main 
result of this paper is that distributed content caching in 
wireless cellular networks turns out to be an energy efficient 
solution. 


II. System Model 

We consider the downlink scenario of a single-tier cellu¬ 
lar network in which the BSs are distributed on the two- 
dimensional Euclidean plane according to a homogeneous 
spatial PPP with density Ab denoted by $b = {r-ijigN, where 
ri S is the location of the i-th BS. The mobile UTs 
(or users) are modeled to be distributed on the same plane 
according to an independent homogeneous PPP with density 
Xu > Xb, which is denoted by with sj G 

being the location of the j-th user. Without loss of generality, 
we focus on a typical UT placed at the origin of the coordinate 
system for calculating the key performance metrics of interest. 

In this system model, we assume that the signal transmitted 
from a given BS is subject to two propagation phenomena 
before reaching a user: (i) a distance dependent pathloss 
governed by the pathloss function g{r) = br~°', where b 
is the pathloss coefficient and a is the pathloss exponent, 
and (ii) Rayleigh fading with mean 1. Therefore, the signal 
strength from the i-th BS as received by the typical user can 
be expressed as 

P^ivi) = h,Pbr~'^, ( 1 ) 


where the random variable hi denotes the power of Rayleigh 
fading and P is the transmitted power. In addition, we assume 
that background noise is present in the system, with variance 
= f3Xb, with (3 = where B is the total available 

bandwidth, F is the receiver noise figure, k is Boltzmann 
constant, and T is the ambient temperature. 

As alluded earlier, the typical user connected to the i-th BS 
receives a signal of power Pi{ri). This in turn means that the 
sum of the received powers from the rest of the BSs contributes 
to the interference to this signal. As a result, the received SINR 
at the typical user is given by 


SINR = 


h^g{r^)P 
cr'^ + h 


( 2 ) 


where Ii = J2r G<t>b\r Pi cumulative interference 
experienced from all the BSs except the i-th BS. 

In a downlink scenario, although the typical user may 
technically be served from any BS, a connection with a 
particular BS has to be established according to a association 
policy that satisfies a certain performance metric, e.g. ensuring 
QoS. In this work, we assume that a mobile user connects to 


the BS that provides the maximum SINR. This is formally 
expressed as 

max SINR(i) > 7 , 
nS'i't 

where 7 is the target SINR. Given the above definition, the 
typical user is said to be covered when there is at least one 
BS that offers an SINR > 7 . If not, we say that the typical 
user is not covered. We assume that 7 > 1, which is needed 
to ensure that there is at maximum one BS that provides the 
highest SINR for a user at a given instant 0 . 


A. Coverage and Quality of Service 

Among the main objectives of wireless networks is to guar¬ 
antee a certain QoS for the users. The choice of a specific QoS 
metric influences both the complexity and the operation of the 
network. In this paper, we consider the coverage probability 
as the QoS that has to be assured. The coverage probability 
for strongest BS association for a general pathloss function 
glr) is given as fli), p3) 


Pcov{P, Xb) 


where 


P[SINR > 7 ] 

/*oo 

nXb / exy>{-q{P,Xb,r))dr, 
Jo 


(3) 


q{P,Xb,r) 


PgiVr) ^ Jo giVr) + ig{y/n) 


(4) 


Denoting by P^^ the coverage probability in the case of no 
noise (cr^ — 0 ), the expression for the coverage probability in 
0 may be approximated in a low noise regime as 0 

Pcov(P, Xb) = pr (^1 - ^, (5) 

= b[C(a^ft (^)' 

The specific QoS constraint we consider here is that the 

coverage probability experienced by a typical user has to be 
always greater than the coverage probability achieved in the 
absence of noise, i.e. 


Pcov>Peo^ (6) 

As a consequence, it can be shown (from Q and that 

the optimal BS density A^ that guarantees the above QoS 

constraint is 

a: > (7) 

P “-2 

, 2 

with A = A “- 2 . We use this expression in the optimization 
problem we formulate in Section m 

Once the typical user is covered with a certain SINR, the 
traffic requests of this typical user have to be satisfied from 
its BS, by bringing the content from its source on the Internet 
via the backhaul. In practice, even though the typical user is 
covered and benefits from high SINR, it is obvious that any 
kind of bottleneck in the backhaul may result into long delays 
to the content, degrading the overall quality of experience 
(QoE). As part of dealing with this bottleneck, we assume 








that the BSs are able to store the users’ (popular) content in 
their caches, so that requests can be satisfied locally, without 
passing over the limited backhaul. This is detailed in the 
following section. 


B. Cache-enabled Base Stations 


Several studies have shown that multiple users actually 
access the same content very frequently. Take for instance 
some popular TV shows, the case of viral videos with over 
a billion viewings, news blogs, online streaming, etc. In 
this context, the network will be inundated with requests 
for the same content that might largely increase the latency 
or, eventually, congest the network itself. Otherwise stated, 
certain types of content (or information) are relatively more 
popular than others and are requested/accessed more often 
by the users GD- Therefore, it is reasonable to assume that 
a user’s choice distribution matches with the global content 
popularity distribution. As mentioned before, the logic behind 
having cache-enabled BSs is to exploit this likelihood and 
store locally at the BSs serving a typical user the content 
with highest demand (popular demand) so that both users and 
service providers get an incentive when a popular request is 
made. 

Let us assume that each BS is equipped with a storage unit 
(hard disk) which caches popular content. Since the storage 
capacity cannot be infinite, we assume that at each BS a set 
of content up to /o (the catalog) is stored on the hard disk. 
Rather than caching uniformly at random, a smarter approach 
will be to store the most popular content according to the given 
global content popularity statistics. We model the content 
popularity distribution at a typical user to be a right continuous 
and monotonically decreasing probability distribution function 
(PDF), denoted as m 


/pop if, v) 


(^- 1 )/-", /> 1 , 

0 , /<!, 


( 8 ) 


where / is a point in the support of the corresponding content, 
and r] represents the steepness of the popularity distribution 
curve. We define the steepness factor to be the (average) 
number of users per BS, that is p = ^. The justification for 
the above model is that the higher the number of users attached 
to a BS, the more accurately the trend is sampled, hence 
the more content is sorted towards the left of the distribution 
thereby making it steeper. Moreover, since A„ > Xb, we have 
that p > 1 . 

Now, given the fact that BSs cache the catalog according to 
the content popularity distribution in (j^, the probability that a 
content demanded by a connected user falls within the range 

[ fpopif,v)df 

JO 
ffo 

/ {r,-l)f-^df 

Jo 

1 - (9) 


[ 0 , /o] is given by 

Phil = 


It can be verified that Phit converges to 1 when /q —>■ oo, 
namely when the catalog stored in the BSs goes to infinity. 
Consequently, the probability that a request is missing from 
the catalog can be expressed as Pmiss = An illustration 

of the system model is given in Fig. [T] including snapshots of 
PPPs and visualization of the content popularity distribution. 
In the following, we introduce a power model which takes 
into account the caching capabilities at BSs, and will be 
used for investigating the energy aspects of cache-enabled BS 
deployment. 

C. Power Consumption Model 

In the context of studying caching at the edge, there is 
a large scope for adopting an extensive power model that 
takes into account various detailed factors. In this paper, we 
deliberately restrict ourselves to a basic power model as a first 
attempt to relate energy efficiency to a cache enabled wireless 
network. We address a more complete model in an extended 
version of this paper. We consider two different power models 
depending on whether BSs have caching capabilities or not. 

1) With caching: The components of the total power con¬ 
sumed at an operating BS is given as follows: 

1) A constant transmit power, P. 

ii) An operational charge at each BS, Pq- 

iii) Power needed to retrieve data from the local hard disk 
when a content from the catalog is requested, Phd- 

iv) Power needed to retrieve data from the backhaul when a 
content outside the catalog is requested, Pbh- 

We assume that Pbh > Phd motivated by the realistic constraint 
that it is more power consuming to utilize the backhaul 
connection than to retrieve stored information from the local 
caching/storage entity. 

Therefore, the total power consumed at a given BS is sum 
of all the components such as 

Ptot — P + Po + Pbh X Pmiss + Phd X Phit 

= P + Po + Phd + (Pbh - PM)fo~^ 

= P + Ps + Pd/o”", (10) 

where P^ = P^ -|- Phj and Pd = Pbh - Phd- 

2) Without caching: In the absence of caching, the BS 
has to retrieve the requested content from the backhaul every 
service timeslot. This is equivalent to the case where Phh = 0 
(or Pmiss = 1)- The total power consumed at a given BS is 
given as 

Ptot = P + Po + Pbh 

= P + P, + Pd. (11) 

III. Area Power Consumption 

The power expenditure per unit area, also termed as APC, 
is an important metric to characterize the deployment and 
operating costs of BSs, also indicating the compatibility of 
the system with the legal regulations. In our system model, 
the APC for cache-enabled BSs is defined as 

= AbP/ot 


( 12 ) 



Figure 1; An illustration of the considered system model. The snapshots of PPPs for i) mobile users and ii) base stations on 
unit area are given on the left side. The content popularity distribution for different values of rj is given on the top right side, 
showing that lower values of p corresponds to a more uniform behavior. 


In the same way, the APC of BSs with no caching capabilities 
is defined asV = AbTlot- In this section, we aim at minimizing 
separately the APC for both cases (caching and no caching), 
while satisfying a certain QoS. This can be formally written 
as 

minimize 'P{P) or 

PG[0,oo) ^22) 

subject to Pcov(^’, At,) > P^^. 


Consider first the case where BSs have caching capabilities. 
The following result can be obtained for the solution of the 
optimization problem in ([T3]l. 


Proposition 1. Suppose that BSs have caching capabilities, 
thus V^‘^^{P) is the objective (utility) function in {13\. Then, 
for a = 4 + e, e > 0, the optimal power allocation P* that 
solves (IZ3 is lower bounded as 


2P 

p* > _^ 


e 


(14) 


Proof: Using the expression for optimum AJ from 0 and 
incorporating it in ( [T^ , we get the expression for APC as 

= ^^iP + Ps + Pafo~^''^^)- ( 15 ) 


Let e be a real number and write a = 4 + e. 


'pP) 


A , 1 _Ai£p 2 + c 

^(P + P, + Pd/o " 

P 2 + e 

^(pi-sfr -yp^p-ih +PdP' 


P2+E 

)• 


(16) 


For e < 0, is a monotonically decreasing function and 
no minimum point exists. However, for e > 0 (i.e. a > 4), the 
first term in ( [T6l l dominates as P oo, indicating that there 
exists a minimum where the derivative of the curve changes its 
sign from negative to positive. We set e > 0 for what follows. 


Differentiating with respect to P we get 


P- 


1±1 p — 


Att-P 


10 


dP 


c + 2 
2 

Ai, P e+2 


A(Pe - 2PQ/o ^ - 2A„/oPd log(/o)P^ - 2A/oPd. 

(17) 

Given the fact that P is always positive, the derivative in 0 
remains negative as P increases from 0 until it is sufficiently 
greater than —, after which the derivative can change its sign 
to positive. This indicates that there exists a minimum value 
for P^^'> when 

2P 

P* > —, (18) 

e 

which concludes the proof. ■ 

For the case where BSs have no caching capabilities, the 
following result is derived. 

Proposition 2. Suppose that the BSs have no caching capa¬ 
bilities, thus P(P) is the objective function in ( |75| ). Then, for 
a = 4 + e, e > 0, the optimal power allocation P* that solves 
( |73] | is given by 

^ 2(Pjr + Pd) ^ 

e 

Proof: In the case without caching, by similar treatment 
as in the proof of Proposition [T] we write the expression for 

A 


APC as 


V=^[P + P, + Pa). 

P2+e 


( 20 ) 


It can be noticed that P is a monotonically decreasing function 
and has no minimum except when e > 0 (or a > 4). 
Differentiating V with respect to P we get 


dP 

dP 


AP- 


■SI(-2Pd-2P, 


Pe) 


6 + 2 


( 21 ) 


By equating the derivative to zero, the optimum power 

+ , 22 ) 

dP e 























i2 

It can easily be verified that the second derivative > 0 
for P = P*. m 

A. Remarks 

For a given finite value of transmit power P, from ( [Tfil l and 
we observe that, in all sensible cases, 

< r. (23) 

This indicates that BSs with caching capabilities always out¬ 
perform those without caching. Additionally, can be made 
smaller by increasing the catalog size fo in BSs. This is indeed 
intuitively correct, as shown in Fig. where some realistic 
power values from m are considered. 



Figure 2: APC vs. Transmit power with and without caching 
for values: Pg = 25W,Pd = 10W,/3 = 1, and a = 4.75 in 
@ and 

Fig. illustrates the variation of APC with respect to the 
transmit power in the cases with and without caching, for 
different values of pathloss exponent a. It can be noticed 
that the APC has a minimum for a certain power value only 
when q; > 4. However, APC can be significantly reduced with 
caching in all cases, and the performance gap between caching 
and no caching cases is increased for a increasing. 

IV. Energy Efficiency 

Another key performance metric that should be studied is 
the energy efficiency, which indicates the amount of utility 
(throughput) that is extracted out of a unit power invested 
for network operation. The standard QoS factor chosen as the 
utility in literature is the area spectral efficiency (ASE). In 
we define the EE as the ratio between ASE and the total 
power spent at the BS, i.e. 

£■ _ log(l + 'y)Pcov{P, Ab) 

Ptot 



Eigure 3: APC vs. Transmit power with and without caching 
for values: P, = 25W, Pj = lOW, /o = 10, A = 2 in ({T^ and 


<0. With caching, the expression for EE in therefore 
becomes 


AblogCl +7)^’c'Sv(l - 
(P, Xb) = - 


1 -' 


P + Ps+Pd/o 

In the case without caching, EE is given as 


£(P,Ab) = . 


Afelog(l + 7«(l-;^i.) 


P + Ps + Pd 


(25) 


(26) 


The following results are given for the maximization of EE 
and the discussions are carried out afterwards. 


Proposition 3. Suppose that the BSs have caching capabilities 
and let Pj* denote the optimal power allocation that maximizes 
S^‘=\P,Xb). Then 

Pi = 1 + \ll + Ps + Pdfo~^ ■ (27) 

Proof: We differentiate the expression ( |25] l with respect 
to P and solve for the optimum power P* 

^^=0 ^ 2P-P" + P, + Pd/o”^ =0 

^ pr = 1 + \/l + Ps + Pd/o”^- (28) 

It can be verified that the second derivative ^2 ^ is negative 
for the positive solution of P*. ■ 

Proposition 4. Suppose that the BSs have no caching capa¬ 
bilities and let Pf denote the optimal power allocation that 
maximizes £('=i(P,Ah). Then 

p; = l + s/^ + Ps + Pd- (29) 


In order to define more precisely the EE metric for both cases. Proof: Similar to ( |28l ), we can show that the optimum 

we use the expression for coverage probability established in power P 2 = 1 + ^/l + + Pd- It can be verified that the 





















































j‘2 p 

second derivative is negative for the positive solution of 

PI. ■ 

A. Remarks 

Based on the above results, we can make the following 
observations; 

1) For given positive values of Xf, and P, is 

always higher than £{P,Xb) making it apparent that 
implementing BSs with caching capabilities is an energy- 
efficient solution. 

2) For a fixed value of Xb, has a maximum at P* = 

1 -f \/l -f Ps -f Pd/o and £{P,Xb) has a maximum 
at P 2 = 1 -f \/l + + Pi- Noting that P* < P 2 *, we 

observe that the optimal EE may be attained at a smaller 
value of transmit power in the case of cache-enabled BSs. 

In Eig. 1^ we plot the variation of EE as a function of the 
transmit power P. It can be seen that EE can be significantly 
increased (in the case of caching) by increasing the size of the 
catalog in BSs, namely /q. 


• 10 “^ 



Eigure 4; EE vs. transmit power with and without caching 
for values: P^ = 25W, Pd = lOW, a = 4.75,/3 = 1,P^J^ = 
1, Xb = 0.5, Xu = 0.6, and 7 = 2 in ( |25l l and ( |26] l. 


V. Conclusions 

In this work, we studied how incorporating caching capa¬ 
bilities at the BSs affects the energy consumption in wireless 
cellular networks. Adopting a detailed BS power model and 
modeling the BS locations according to a PPP, we derived 
expressions for the APC and the EE, which are further sim¬ 
plified in the low noise regime. A key observation of this work 
is that cache-enabled BSs can significantly decrease the APC 
and improve the EE as compared to traditional BSs. We also 
observed that the existence of an optimum power consumption 
point for the APC depends on the pathloss exponent. 

The energy aspects and implications of caching in wireless 
cellular networks, especially for 5G systems, are of practical 
and timely interest and clearly require further investigation. 


Euture work may include heterogeneous network scenarios, 
including small cells, macro cells and WiEi access points 
deployment. Eurthermore, storing the popular content requires 
accurate estimation of content popularity distribution, which 
cannot be easily performed in practice and may cost energy 
in terms of processing power. Therefore, rather than relying 
on this approach, randomized caching policies in a stochastic 
scenario |20| can be considered as a means to provide crisp 
insights on the energy efficiency benefits of caching in dense 
wireless networks. 
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